Using Unlabelled Data To Update Classification Rules With Applications In Food Authenticity Studies
نویسندگان
چکیده
A classification method is developed to classify samples when both labelled and unlabelled samples are available. The classification rule is estimated using both the labelled and unlabelled data, in contrast to many classical methods which only use the labelled data for estimation. This methodology models the data as arising from a Gaussian mixture model with parsimonious covariance structure, as is done in model-based clustering (Fraley and Raftery (2002)). A missing-data formulation of the mixture model is used and the models are fitted using the EM and CEM algorithms. A comparison of the performance of model-based discriminant analysis and the proposed method of classification is given. The methods are applied to the analysis of spectra of foodstuffs recorded over the visible and near-infrared wavelength range in food authenticity studies. The aim of this study is to classify the foodstuffs using their spectra. The proposed classification method is shown to yield very good misclassification rates. The correct classification rate was observed to be as much as 15% higher than the correct classification rate for model-based discriminant analysis.
منابع مشابه
Variable Selection and Updating in Model-based Discriminant Analysis for High Dimensional Data with Food Authenticity Applications by Thomas
Food authenticity studies are concerned with determining if food samples have been correctly labeled or not. Discriminant analysis methods are an integral part of the methodology for food authentication. Motivated by food authenticity applications, a model-based discriminant analysis method that includes variable selection is presented. The discriminant analysis model is fitted in a semi-superv...
متن کاملVariable Selection and Updating In Model-Based Discriminant Analysis for High Dimensional Data with Food Authenticity Applications.
Food authenticity studies are concerned with determining if food samples have been correctly labelled or not. Discriminant analysis methods are an integral part of the methodology for food authentication. Motivated by food authenticity applications, a model-based discriminant analysis method that includes variable selection is presented. The discriminant analysis model is fitted in a semi-super...
متن کاملDetecting Concept Drift in Data Stream Using Semi-Supervised Classification
Data stream is a sequence of data generated from various information sources at a high speed and high volume. Classifying data streams faces the three challenges of unlimited length, online processing, and concept drift. In related research, to meet the challenge of unlimited stream length, commonly the stream is divided into fixed size windows or gradual forgetting is used. Concept drift refer...
متن کاملAutomatic Interpretation of UltraCam Imagery by Combination of Support Vector Machine and Knowledge-based Systems
With the development of digital sensors, an increasing number of high-resolution images are available. Interpretation of these images is not possible manually, which necessitates seeking for practical, fast and automatic solutions to solve the environmental and location-based management problems. The land cover classification using high-resolution imagery is a difficult process because of the c...
متن کاملThe influence of event attributes on tourist’s loyalty: Evidence from the Ashoura event in Yazd City
Many studies have found that the perceived authenticity of cultural and religious events affects event satisfaction and loyalty. Little is currently known about how perceived authenticity is affected by the facilities, such as food and the availability of information, which are independent determinants of satisfaction and loyalty. This study aims to examine the antecedents of event loyalty. Que...
متن کامل